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Abstract 

We show that a sequence over a finite field ¥ q of characteristic p is p- 
i— — i automatic if and only if it occurs as a column of the spacetime diagram, with 

eventually periodic initial conditions, of a linear cellular automaton with mem- 
J> ory over ¥ q . As a consequence, the subshift generated by a length p-substitution 

can be realized as a topological factor of a linear cellular automaton. 

o 

1 Introduction 

O 

In a cellular automaton, each cell has a value at each time step, so it is natural to 
consider the sequence of values taken by a given cell at times 0, 1, 2, etc. For a one- 
dimensional cellular automaton, such a sequence of states is a column sequence in 
the two-dimensional spacetime diagram. Some column sequences are quite simple, 
such as the characteristic sequence of powers of 2, which occurs as a column in 
Rule 90 begun from a single black cell. On the other hand, some sequences are 
statistically random, such as the center column of Rule 30. 

A relatively unexplored question is the following. Given a sequence on a hnite 
alphabet, does this sequence occur as a column of a cellular automaton spacetime 
diagram? Without additional restrictions, it is possible to obtain any sequence (for 
example, the base-10 digits of ir) as a column: simply place the sequence in the 
initial condition, and let the cellular automaton be the shift map a. To avoid this 



'Partially supported by an NSERC grant. 



1 



trivial case, we impose the restriction that initial conditions be eventually periodic 
in both directions. 

Wolfram found, by brute-force search, spacetime diagrams containing the char- 



acteristic sequence of squares and the Thue-Morse sequence [Wol02, page 1186 . It 



is also possible to construct spacetime diagrams with more exotic column sequences, 
such as the characteristic sequence of primes |Wol02t page 640 1 . 

In this paper we study p-automatic sequences occurring as columns of cellular 
automaton spacetime diagrams. We assume that p is prime and ¥ q is a finite field 
of characteristic p throughout. Litow and Dumas [LD93] gave several examples of 
cellular automata containing well-known p-automatic sequences as columns. They 
also proved that each column of a linear cellular automaton over ¥ q , begun from 
an initial condition with finitely many nonzero entries, is necessarily p-automatic. 
We use their approach to establish the following characterization of p-automatic 
sequences. (All relevant definitions are in Section [2]) 

Theorem 1.1. A sequence of elements in ¥ q is p-automatic if and only if it is a 
column of a spacetime diagram of a linear cellular automaton with memory over ¥ q 
whose initial conditions are eventually periodic in both directions. 

Furthermore, the proof in each direction is constructive. In particular, there is 
an algorithm to compute the local cellular automaton rule, given a finite automaton 
for the p-automatic sequence. 

As a consequence of Theorem |1.1| we are also able to prove the following. 



Theorem 1.2. If u = (u n ) n >o is p-automatic, then for some q and d > 1, there 



exists a linear cellular automaton <i> : (¥ q ) z — > (¥ q ) z and a subsystem (Y, $) of 



{(¥^) £ , $) such that (X u ,a) is a topological factor of (Y, <£). 

We remark that for each n > 1 a sequence is ^"-automatic if and only if it is p- 
automatic, so Theorems 1 1 . 1 1 and 1.2 also apply to preautomatic sequences. Moreover, 



by injecting the alphabet of a general p-automatic sequence into some ¥ q , we can 
find an image of that sequence, under the injection, as a column of a spacetime 
diagram. 

The proof of Theorem 1.1 appears in Section [3] along with some corollaries. In 



Section [4] we discuss an algorithm that, given a finite automaton for a sequence, 
generates the desired cellular automaton with memory, and we compute several 
examples. In Section [5] we prove Theorem L2 and give conditions that ensure that 



the factor mapping of Theorem 1.2 is an embedding. 
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2 Definitions and notation 



In this section we recall definitions of some terms that we use. Let Sj. = {0, 1, . . . , k — 
1}. If n = X^=o ai ^ % i s t ne standard base-/c representation of n with < < k — 1 
and ai / 0, define {n)k to be the word a^a\ ■ ■ ■ a/. We start with the cumbersome, 
but necessary, definition of a finite automaton that generates an automatic sequence: 

Definition 2.1. A deterministic finite automaton with output (DFAO) is a 6-tuple 
(5, Sfc, 5, so, A, to), where S is a finite set of "states", so G 5 is the initial state, A 
is a finite alphabet, cj : 5 — )• A is the output function, and (5 : 5 x S/. -> 5 is the 
transition function. 

In the symbolic dynamics literature, ui is also known as a coding or a letter-to- 
letter projection. The function (5 extends in a natural way to the domain 5 x SJ; 
for example 5(s,aoai) = 8(S(s,ao),ax). This allows us to feed the automaton with 
base-fc representations of natural numbers: 

Definition 2.2. A sequence (it n )n>o of elements in A is k-automatic if there is a 
DFAO (5, Sfc, 5, so> -4> w) such that u n = w(<5(so, (^)fc)) for all n > 0. 

Example 2.3. The Thue-Morse sequence is the sequence (u n ) n >o = 
0, 1, 1, 0, 1, 0, 0, 1, . . . where u n = if the number of occurrences of 1 in the bi- 
nary representation of n is even and u n = 1 otherwise. The Thue-Morse sequence is 
2-automatic, and it is generated by the following automaton, where the two states 
are labeled with their images under u>. 




Let N := {0, 1, . . .}. If M = Z or M = N, then the space of all M-indexed 
sequences from A is written as A M , and an element in „4 M , a configuration, is 
written R = (i?(m)) me M- (Our configurations will be thought of as rows of a two- 
dimensional array.) Let A be endowed with the discrete topology and A M with the 
product topology; then „4 M is a Cantor space, that is, a zero-dimensional perfect 
compact metric space. If b £ A and m £ M, the clopen sets {R : R(m) = b} 
generate a countable basis for the topology on A M . The (left) shift map a : ^4 M — > 
A M is the map defined as (a(R))(m) = R(m + 1). Given two configuration spaces 
_4. M and B M , we shall use a to refer to the shift map on either of these spaces. Recall 
that if $ : „4 M -)■ £> M , we say that <3? commutes with the shift iiao^ = ^oa. 
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Definition 2.4. A (one-dimensional) cellular automaton is a continuous, a- 
commuting map <I> : A 1, — > A z . 

The Curtis-Hedlund-Lyndon theorem [Hcd69] states that $ is a cellular automa- 
ton if there is some local rule (j) '■ A l+r+1 — > A for some Z > (the left radius of <ft) 
and r > (the right radius of (f> ), such that for all R G A z and all m G Z, 

($(i?))(m) = <£(i?(m - Z), i?(m - Z + 1), . . . , R(m + r)) . (1) 

Conversely, any local rule <^> defines a cellular automaton $ using Identity Q. 

The definition of a cellular automaton with memory is a slight generalisation of 
Definition E! 

Definition 2.5. A (one-dimensional) cellular automaton with memory d is a con- 
tinuous, ^-commuting map : (A d ) z —> A z . 

The Curtis-Hedlund-Lyndon theorem also holds for a cellular automaton with 
memory, so that there is a local rule <f> : (A d ) l+r+1 — > A satisfying ([!]), and conversely 
any such local rule defines a cellular automaton with memory. We shall often use 
the fact that the domain of a cellular automaton with memory d is also (A z ) d . 

Definition 2.6. If 3> : (A d ) z A z is a cellular automaton with memory d, then a 
spacetime diagram for $ with initial conditions Rq, . . . , Rd-i is the sequence (R n )n>o 
where we inductively define R n := <&(R n -d, ■ ■ ■ , Rn-i) for n> d. 

We visualize a spacetime diagram by letting time evolve down the page: the nth 
row R n represents the configuration at time n, and R n (m), the entry on Row n and 
Column m of the spacetime diagram, is the state of the mth cell at time n. Whereas 
each row in an ordinary cellular automaton (with memory 1) is determined by the 
previous row, in a cellular automaton with memory d each row is determined by 
the previous d rows. To be brief, we will often use the term 'cellular automaton' to 
mean 'a spacetime diagram of the cellular automaton'. 

Now suppose that A is the finite field ¥ q . In this case ¥ z and (¥ d ) z are groups, 
with componentwise addition; they are also F g -vector spaces. 

Definition 2.7. We say that the cellular automaton <I> : (F^) z — > with memory 
d is linear if $ is an F g -linear map. 

Thus the Curtis-Hedlund-Lyndon theorem implies that the memory-cZ cellular 
automaton <I> is linear if and only if there exist coefficients fj i G ¥ q for — Z < j < r 
and < i < d - 1 such that ($(i? , . . . , R d -i))(m) = ££o ££ = _, fj,i Ri{m + j) 
for all i?o, . . . , Rd-i G Fj? and m G Z. An example of a linear cellular automaton is 
Rule 90, whose field is F2 and whose local rule is {§{R)){m) = R(m — 1) + R(m + 1); 
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Figure 1: Spacetime diagram of a linear cellular automaton with memory 12 con- 
taining the Thue-Morse sequence as a column. 

this rule has been studied extensively in the literature, as have been linear cellular 
automata in general; the algebraic properties of the local rule yield much theoretic 
structure. 

Example 2.8. Figure [T] shows the first 256 rows of the spacetime diagram of a linear 
cellular automaton with memory 12 over ¥2, where is rendered as a white cell and 
1 is rendered as a black cell. The "center" column, containing the top vertex of the 
triangular region, consists of the Thue-Morse sequence. This column is highlighted 
by rendering as light gray. We compute the local rule for the cellular automaton 
in Section HI 

2.1 Classical results 

We recall some results that we shall use. Recall that we use p to denote a prime 
and q to denote a power of p. Let ¥ q be the field of cardinality q. If (u n ) n >o is 
p-automatic generated using the DFAO (S, T, p , 5, sq, A, u), find q such that \A\ < q. 
By injecting A into ¥ q , we can, and henceforth do, assume that the output function 
oj has range in ¥ q . 

Recall that ¥ q [t], ¥ q (t), and ¥ q ((t)) are the sets of polynomials, rational functions, 
and formal Laurent series respectively with coefficients in ¥ q . Elements of ¥ q ((t)) 
are expressions of the form F(t) = ^2 n>no u n t n , where u n £ ¥ q and no G We can 
also define these sets in several variables, for example ¥ q ({t,x)). The Laurent series 
F(t) is algebraic over¥ q {t) if there exists a nonzero polynomial P(t,x) G ¥ q [t, x] 
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such that P(t, F(t)) = 0. Finally we define the q-kernel of a sequence ( , u n ) n >o to be 
the collection of sequences {(u q k n+r ) n >o '■ k > 0, < r < q k — 1}. The equivalence 
of Statements 1 and 2 in Theorem |2.9| was shown first by Christol, for p = 2, and 
then more generally in [CKMFR80]; the equivalence of Statements 2 and 3 dates 
back to Eilenberg |Eil74| . 

Theorem 2.9. Let (u n ) n >o be a sequence of elements in ¥ q . The following are 
equivalent: 

1. F(t) = J2n>o Un ^ n ^ s algebraic over¥ q (t). 

2. The q-kernel of (ii n )n>o is finite. 

3. (u n ) n >o is q-automatic. 

We can also define a formal series E(t, x) = Yl n >n Smez a n,mt n x m in two vari- 
ables t and x. In this case the diagonal is the formal Laurent series 

n>no 

in one variable. Similarly we define the mth column of E(t, x) to be 

V a t n 

n>no 

We say that E(t, x) is a rational series if there exist polynomials Q(t, x) and 
P(t,x) such that P(t,x)E(t,x) = Q(t,x). The following result is due to Fursten- 
berg |Fur67j : 

Theorem 2.10. For a Laurent series F(t) G F 9 ((i)) to be algebraic over¥ q {t), it is 
necessary and sufficient that it is the diagonal of a rational Laurent series E(t,x). 

The necessary direction of Furstenberg's theorem follows from the following two 
propositions in |Fur67j . which we will use in Section [3j 

Proposition 2.11. Suppose that the Laurent series F(t) = J2 n >n c ^ n € F 9 ((i)) is 
algebraic over ¥ q (t). Then there exists r* > uq, m > 0, and a polynomial 

P*(t,x) = A* {t)x + A\(t)x v + ■■■ + A* m {t)x pm + B*(t), 

with A*(t),B*(t) € ¥ q [t] and A%(t) not divisible by t, such that F(t) = R*(t) + 
f'G*{t), R*(t) = Zn= no cnt n , and P*(t,G*(t)) = 0. 

Denote P x (t,x) = |f (t,x). 

Proposition 2.12. Suppose that the series G(t) = J2n>i c ^ n £ ^?((^)) satisfies 
P(t,G(t)) = 0, where P(t,x) G ¥ g [t,x] and P x (0,0) ^ 0. Then G(t) is the diagonal 
of the unique series expansion of 

x 2 P x (tx,x) 
P(tx,x) 
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3 Columns of linear cellular automata 



In this section we prove Theorem Theorems 2.9 and 2.10 are main ingredients. 



To work with spacetime diagrams algebraically, we represent the spacetime dia- 
gram of a cellular automaton as a bivariate series, a technique first used by Martin, 
Odlyzko, and Wolfram [MOW84]. If dn,m is the entry of the spacetime diagram on 
row n 6 N and column m G Z, then the series E(t, x) = Y^ n >o Smez a n,mt n x m en- 
codes the entire cellular automaton evolution from the initial condition Rq . We iden- 
tify the nth row R n of the spacetime diagram with the series R n {x) = YlmeZ a n,m,x m , 
which is the coefficient of t n in E(t, x) = J2n>o Rn{x)t n . 

We begin with the easier direction of Theorem Litow and Dumas |LD93j 
established the case where the initial condition is zero everywhere save the central 
entry and the memory is 1. The proof of the more general statement is similar. 

Theorem 3.1. Let & be a linear cellular automaton rule with memory d over¥ q . 
Let Ro, ■ ■ ■ , Rd-i £ IF? be rows that are eventually periodic in both directions. For 
each m £ Z, the column sequence {R n {m))n>o is p-automatic. 

Proof. The linearity of <I> is equivalent to the existence of Laurent polynomials 
Ci(x), . . . , Cd{x) such that for all n > d 

d 

Rn{x) = Y,^( X ) R n-i(x)- 
i=l 

Let Cq{x) := —1; then Yli=o Ci( x )Rn-i(x) = for all n > d, so 

(Y J c l (x)AE(t,x)=(Y,c l (x)A [y^r^w 

\i=o / \j=o / \j>0 



Y,[Y, c ^ R ^ x n tn 

n>0 \i=0 / 

d-l / d \ 



n=0 \i=0 / 

Each R n _i(x) is a rational expression since it is the sum of two one-sided eventually 



periodic series. Therefore (£? =0 d{x)t % j E{t , x) is a rational expression in t and 
x, and since Yli=o Ci{x)t % is also rational this implies that E{t,x) is also rational. 



Column m of E(t, x) is the diagonal of x m E(tx, x) and therefore by Theorem 2.10 
is algebraic over ¥ q . Hence, by Theorem 2.9, the sequence of entries in column m is 



p-automatic. □ 

For the other direction of Theorem we basically reverse the steps of the pre- 
vious proof. The difficulty arises in obtaining a recurrence Yli=o Ci( x )Rn-i(x) = 
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in which Cq(x) is a (nonzero) monomial. It is necessary that Cq(x) is a monomial so 
that each is a Laurent polynomial, and hence the update rule that determines 
the value of each cell is local. Since the coefficients CAx) come from the denom- 



inator of E(t,x) and the denominator of E(t,x) comes (by Theorem 2.10) from a 
polynomial equation satisfied by F(t), we seek a polynomial P(t,x) where the co- 
efficient of t° is a nonzero monomial in x, and where P(t,F(t)) = 0. The following 
proposition allows us to find such a polynomial. 

Proposition 3.2. Suppose that F(t) = J2n>o u n^ n 6 is algebraic over¥ q (t). 

Then there exist G(t) S ¥ g ((t)) and P(t, x) E ¥ q [t, x] of the form 

P(t, x) = A (t)x + A^x'P + ■■■ + A m (t)x pm + B{t) 

with Ai(t),B(t) £ ¥ q [t] for0<i<m such that 

1. F(t) = R(t) + t r G(t) for some r > and R(t) G ¥ q [t], 

2. G(0) = 0, 

3. Mo) + o, 

4. B(0) = Ai(0) =0forl<i<m, and 

5. P{t,G{t)) = 0. 



Proof. Proposition 2.11 tells us that for some r* > there exist polynomials 
A*{t),B*(t) € ¥ q [t] such that ^(0) ^ and G*(t) = T, n >o u n+r*t n satisfies 
P*{t,G*(t)) = 0, where 

P*(t,x) = A* (t)x + Af(t)x p + ■■■ + A* m (t)x pm + B*(t). 

Let r = r* + 1 and R{t) = u + u\t H h Urt r - Write = it r -i + u r t + iG(t) 

so that G(t) = ^n>i u «+^* n - Expanding P*(t,u r -i + ii r i + tx) and using that 
(a + b) p = a p + b p in characteristic p shows that x = G(t) satisfies 

A* (t)tx + Al(t)t p x p + ■■■ + A* m {t)t pm x pm + B**(t) = 

for some polynomial B**(t). Since each formal power series A*{t)t p ' G{t) pl = 
A*{t)t 2pl (G{t)/t) pl is divisible by t 2 , B**(t) is also divisible by t 2 . Dividing by 
t, let Ai{t) = A*(t)t pt -\ B{t) = B**(t)/t, and 

P(t, x) = A {t)x + Ai{t)x p + ■■■ + A m {t)x pm + B(t). 

One verifies that the conclusions of the proposition are satisfied. □ 

We may now prove the other direction of Theorem |1.1[ 
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Theorem 3.3. If (u n ) n >o is a p-automatic sequence of elements in¥ q , then (u n ) n >o 
occurs as a column of a linear cellular automaton with memory over¥ q , whose initial 
condition rows have finitely many nonzero entries. 



Proof. By Proposition 3.2 there exist G(t) and P(t,x) with Aq{£) and r as de- 
scribed. Using the notation of the proposition, we shall show that the shifted se- 
quence u r +i, u r+ 2, ■ ■ ■ can be found as a column of a spacetime diagram of a linear 
cellular automaton with memory, and then we describe how to reinstate the initial 
terms. Let us write P(t,x) = Yli=o Ci(x)t l where Ci(x) G F g [x]. Conclusion 4 of 



Proposition 3.2 implies that Cq{x) = ^(O)^, and Conclusion 3 is that ^4o(0) 7^ 0. 
Conclusions 2, 5, and 3 of Proposition |3.2| imply that the conditions of Proposi- 



tion 



2.12 



are met. Therefore G(t) is the diagonal of x p^x)^ • ^ f°U ows that G{t) is 
Column —2 of pT^jx ■ Since the coefficient of t° in P(t,x) is the monomial Aq(0)x, 
write P(t, x) = Aq(0)x + tQ{t, x) where Q(t, x) S ¥ q [t, x]. Then expand 



P x (t,x) P x (t,x) 1 Px(t,X 



P(t,x) A (0)x i I A (0)x x 

v ' ' uw i+ A (0)x V ' n>0 v "w / „> 

as a series in t. Each R n (x) is a Laurent polynomial. 

Create a two-dimensional array where the entry in Row n £ N and Column 



m G Z is the coefficient of x m in R n (x). By Proposition 2.12, Column —2 of this 
array consists of the sequence 0, u r +i, u r +2, • • • • It remains to show that the array 
is the spacetime diagram of a cellular automaton with memory and to then restore 
the terms uq,u±, . . . , u r . 

Since the series ^2 n>0 R n (x)t n is rational, the sequence (R n (x)) n >o satisfies a 
linear recurrence with coefficients Cq(x), . . . , Cd(x). Namely, multiplying both sides 
by P(t,x) gives 

P x (t,x) = Y,Ci(x)t i ^Rj(xW = E ( E Ci^R^x)] t n 

i=0 j>0 n>0 \i+j=n J 

d / n \ / d \ 

n=0 \i=0 / ra><2+l \i=0 / 

and since P x (t,x) is a polynomial with deg t P x (t,x) < d, we have 
Yli=o Ci( x )Rn-i{x) = for all n > d + 1. Solving for R n (x) gives 

for all n > d+ 1, where each is a Laurent polynomial in x. Therefore the coeffi- 
cient of x m in R n (x) depends only on the coefficients of x m + 1 ~ max i degd(x) ^ ^ x m+1 
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in R n -i(x), . . . , R n -d{x). In particular, each entry of the two-dimensional array is 
computed by the same local rule. Therefore the coefficients of R\{x), Rz^x), . . . 
form the rows of a spacetime diagram of a cellular automaton with memory d, 
where the first d rows consist of initial conditions and rows Rd+ii R(i-\-2i ' • • are com- 
puted by the local rule. Remove row Rq, since no other rows depend on it and 
the coefficient of x~ 2 in Rq(x) is rather than u r . Finally, we restore the initial 
r + 1 terms of (u n ) n >o- We do this by redefining Ro(x) := u r x~ 2 and defining 
R^ r (x) := uox~ 2 , Ri- r (x) := u\x~ 2 , . . . , R-i(x) := u r ^\x~ 2 . Then trivially in- 
crease the memory of the local rule from d to d + r + 1 without actually introducing 
dependence of R n on rows R n _^ d +i), . . . , R n -(d+r+i)- Then the local cellular automa- 
ton rule with memory d + r + 1, run from initial conditions R- r , . . . , Rd, produces 
a spacetime diagram where the sequence (u n ) n >o occurs in Column —2. □ 



The construction in the proof of Theorem |3.3| gives us some additional informa- 
tion about the spacetime diagram. For example, since Cq(x) = Aq(0)x has degree 
1, the right radius of the local rule is at most 1. Therefore the left boundary of the 
nonzero triangular region in the spacetime diagram grows with speed at most 1 cell 
per step. 

Additionally, Column —1 is identically 0. This column can be seen, for example, 
immediately to the right of the Thue-Morse column in Figure[l](and helps the reader 
identify the location of the desired sequence in the rest of our diagrams). To see 
that this is the case, factor P(t,x) = (x — G{t))Q(t, x) for some Q(t,x) S F g ((t))[x], 
following the proof of Proposition |2.12| | Fur67] . Then we have 

Px(t,x) _ 1 Q x (t,x) 
P{t,x) x - G(t) Q(t,x) ' 

These two summands contain the entries of the two halves of the spacetime diagram. 
Since Q(0, 0) ^ 0, the exponent of x in each nonzero term of the series ^n^v is non- 
negative. Moreover, the only nonzero term in the series x _q^ = - X^n.>o(x^^)) n 
whose exponent of x is greater than —2 is i, which appears in Rq(x), which we 



removed in the proof of Theorem 3.3 



In Section|4]we discuss an algorithm to generate the polynomial P(t, x) of Propo- 
sition [372] and thus the cellular automaton as constructed in Theorem 13.31 We also 
work through some examples. First though we mention a few corollaries. 

Corollary 3.4. If (u n ) n >o is a p-automatic sequence, then the sequence (ti n )n>o 
is the letter-to-letter projection of a sequence (v n ) n >o which occurs as a column of 
a linear cellular automaton (without memory) whose initial condition is eventually 
periodic in both directions. 



Proof. Theorem |3 . 3| guarantees the existence of a linear rule <3? with memory d+r+l 
such that (n n ) n >o occurs as a column of some spacetime diagram of <3>. Wrap every 
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(d+r+l)-tuple of consecutive cells in each column into a single cell. In other words, 
consider the new alphabet ¥^ +r+1 . The new cellular automaton <I>* : (F^+ r+1 ) z 
(Fg +r+1 ) z , without memory, has the same left and right radius as the old, and has 
a local rule which takes the "central" cell, discards the top entry of that (d + r + 1)- 
tuple, shifts the most recent d + r entries up, and inserts the output of the old 
local rule at the bottom. This construction means that there is some sequence 
(vn)n>o £ (]W+ r+1 ) N which occurs as a column of some spacetime diagram for <3>*, 
and such that if we project v n to its first entry, then that projection of (v n ) n >o is 
(un)n>o- It is straightforward that <3?* is linear. □ 

Corollary 3.5. // (u n ) n >o *s a P- automatic sequence, then the sequence (u n ) n >o 
occurs as a column of a cellular automaton (without memory) whose initial condition 
is eventually periodic in both directions. 



Proof. Corollary 3.4 provides a cellular automaton on the alphabet F^ +r+1 such that 
the sequence in Column —2, when projected onto first entries, is (u n ) n >Q. We modify 
the cellular automaton to implement this projection and produce a column which 
is the sequence (u n ). Dilate the existing spacetime diagram spatially by adding a 
new column between every pair of consecutive existing columns. Adjust the local 
rule correspondingly so that the retained (now "even-indexed") columns emulate 
the original spacetime diagram. In each new ( "odd- indexed" ) column, let the state 
of each cell be the first entry of the cell to its left on the previous step. These two 
cases combine to form a local rule on the alphabet Fg +r+1 UF g , since if the value of a 
cell is in ¥ q then the rules "knows" to perform the coding and otherwise to perform 
a linear rule on tuples. Finally, remove the top row and use the second row as the 
initial condition, since uq is the projection of an entry on the top row and therefore 
is an entry on the second row. □ 



Note that the cellular automaton constructed in Corollary 3.5 is not linear; in 
particular, the alphabet on which it is defined is no longer a group. 
If $ and \E' are two cellular automata with memory d such that 

($CRo, Ri, ■ ■ ■ , Rd-i), Rd-i, ...,Ri) = Ro 

for all Ro,...,Rd-i € A z , we say that $ is invertible. The spacetime diagram 
of an invertible cellular automaton can be evolved backward in time as well as 
forward, just as the spacetime diagram of a cellular automaton whose local rule is 
a bijective function of the leftmost or rightmost dependent cell can be continued 
up the page [Hed69[ IRow06j . Figure [3] shows the spacetime diagram of such an 
automaton. 

Corollary 3.6. If (u n ) n >o is a p-automatic sequence, then for some r > the se- 
quence (u n ) n > r occurs as a column of an invertible cellular automaton with memory. 
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Proof. It suffices to arrange that Cd{x) is a nonzero monomial, since solving 
Yli=o Ci(x)Rn-i{x) = for R n ^(i(x) then gives a linear local rule for each entry 
on row n — d in terms of entries on rows n — d + 1, . . . , n. 

We may assume that (u n )n>o has infinitely many zero terms, since if not then 
some permutation of ¥ q results in a sequence (v n )n>o with infinitely many zero 
terms, and after constructing a spacetime diagram containing (v n ) n > r we can apply 
the inverse permutation to obtain a spacetime diagram containing (u n ) n > r . 

As in the proof of Proposition |3.2[ start with the polynomial 



P*(t, x) = A* (t)x + Al(t)x p + ■■■ + A* m (t)x pm + B*(t) 



where ^4q(0) 7^ 0, whose existence is guaranteed by Proposition 2.11 However, 



instead of letting r = r* + 1 as in the proof of Proposition 3.2 we determine r as 
follows. 

Observe that the polynomial P*(t, u r * +tx) has the same form as P*(t, x) but has 
the property that for each 1 < % < m the coefficient of x pl is more highly divisible by 
t than A* {t). Similarly, P*(t,u r * +t(u r *+i +tx)) has the same form again but with 
coefficients that are even more highly divisible by t. Under this iterative substitution 
x 1— > u n + tx for n = r*, r* + 1, . . . , the exponent of t grows fastest in the coefficient 
of x pm , so there exists r** > r* such that the highest power of t in the polynomial 

P*(t, U r * + U r * +1 t H h U r **f**- r * + f **~ r * +1 x) 

appears only in the coefficient of x pm . Let r > r** + 1 such that u r = 0, which exists 
since ( , u n ) n >o has infinitely many zeros. 



Now resume the proof of Proposition 3.2: The series x = G(t) := X^n>i u n+rt T: 
is a zero of the polynomial 



P*{t, U r * + U r * +1 t + ■■■ + Ur^f- 1 -^ + Of " r * + f~ r * 

and moreover the highest power of t in this polynomial appears only in the coefficient 
of x pm (and not also in the coefficient of x°, as it would have if u r 7^ 0). Therefore 
Cd(x) is a monomial. By dividing by an appropriate power of t, the conclusions of 



Proposition 3.2, and hence Theorem 3.3 are preserved. □ 



If the polynomials A m (t) and B(t) are monomials, then we can also think of time 
as moving to the left and right; we use this idea in Section [5] to prove Corollary |5.8| 

We conclude this section with some open questions suggested by the previous 
results. 



Corollary 3.5 provides an upper bound of q d+r+l + 1^4| for the number of states 
in a cellular automaton spacetime diagram containing a given p-automatic 
sequence as a column, where the alphabet of the sequence is A C ¥ q . Can this 
bound be improved? 
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Each column in the spacetime diagram constructed in Corollary 3.6 is bi- 
infinite. Does every letter-to-letter projection of a bi-infinite fixed point of 
a p-uniform substitution (see Section [5]) occur as a column of a bi-infinite 
spacetime diagram? 

Does there exist a 3-automatic sequence (u n ) n >o on a binary alphabet such 
that (u n ) is not eventually periodic and (u n ) occurs as a column of a (non- 



linear) 2-state spacetime diagram? Theorem 3.1 rules out the possibility of 
the rule being linear over F2 since a sequence which is both 2-automatic and 
3-automatic is eventually periodic by Cobham's theorem. 

• Which fc-automatic sequences (if k is not a prime power) occur as columns of 
cellular automaton spacetime diagrams? 

4 Examples 

Provided that we have an algorithm for generating the required polynomial in Propo- 



sition 3.2, the proof of Theorem 3.3 shows us how to build the required cellular 



automaton. 



An inspection of the proof of Theorem 2.9 yields an algorithm for producing 
an explicit polynomial equation satisfied by F(t) = Yln>o Un ^ n - ^or, §i ven the 
p-DFAO that defines (u n ) n >o, we can build the g-DFAO (S, E 9 , S, sq, F ? , cJ) that 
defines (u n ) n >o- Next, the g-kernel of (u n ) n >o consists of the (/-automatic sequences 
built with the DFAO (S, "S q , S, s,¥ q ,u}), where the initial state s varies over S. If 
the q-kernel of (it n ) n >o contains d elements, let them be generated by initial states 
s\, . . . ,Sd' thus each s, determines a sequence (u^ )n>o m the q-kernel of (u n )n>o- 
We can then write each generating function Fi(t) of (u$)n>o as an explicit linear 
combination of the functions Fi(t q ), . . . , Fd(t q ). We repeat this procedure d times to 
explicitly express each Fj(t ql ), for 1 < j < d and < i < d, as a linear combination, 



over the field ¥ q (x), of the elements in {Fi(t qd+1 ), . . . ,Fd(t qd+1 )}. Now we have a 



linear relationship between F so (t), . . . , F so (t qd ), and this yields a polynomial that 



is almost of the form required by Proposition 2.11 — the polynomial P*(t,x) may 



need to be modified so that A^t) is not divisible by t. Inspection of the proof of 



Proposition 2.11 in [Fur67] shows that this modification can be done mechanically. 
We refer the interested reader to the proof of Theorems 6.6.2 and 12.2.5 in [AS03]. 

As examples, next we compute the cellular automaton rules and initial conditions 
that generate three well-known automatic sequences as columns. 

Example 4.1. First let us compute the cellular automaton for the Thue-Morse 
sequence shown in Figure [l] Christol's theorem gives tx + (l + t)x 2 + (l + t 4 )x 4 = 
satisfied by x = F(t) = J2 n >o u nt n - 
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Apply the proof of Proposition 2.11 to this polynomial. The coefficient of x 1 is 
already nonzero. However, since it is divisible by t we find an appropriate r such that 
replacing a; with ^n=o u ™^ n t r ~ lx an d dividing by common powers of t leaves 
the coefficient of x 1 not divisible by t. In this case r = 2 suffices, so we replace 
x i y ~t~ tx and divide by t 2 . Then x = G*(t) := ^2 n>0 u n +it n satisfies 



x + (i + t)x z + (r + r)x 



o. 



Now apply Proposition 3.2 Replace x with u r -\ + u r t + tx = 1 + It + tx so that 
x = G(t) := X^n>i u n+2t n satisfies P(t, G(t)) = 0, where 



P(t, x) = (t z + i 9 ) + x + (t + t 2 )x z + {t b + t 9 )x 4 



By Proposition 



2.12 



it n+ 2 is the coefficient of x in R n (x) for all n > 1, where 



R n {x) is the coefficient of t n in the series 



n>0 



+ 1 + x i + • 



n>0 



By collecting the terms of P(t, x) by common powers of t, we see that R n {x) satisfies 
the recurrence 

R n {x) = xRn^ix) + Q + i?„- 2 (x) + i 3 V 5 (i) + Q + x 3 ^) R n -9( x ) 

for all n > 10. This recurrence determines a linear cellular automaton rule $ with 
memory 9. Extend the memory tod + r + l = 12 without introducing dependence 



on the earliest r + 1 = 3 rows. Let R^ix) 



UqX 



-2 



0, R-i(x) = u\x 



and Ro(x) = ii2X~ 2 = x~ 2 . Then the sequence (tt n )n>o occurs in Column —2 of 
the spacetime diagram of <3? begun from initial conditions R-2, • • • , R$- Columns —7 
through 12 of rows R-2, • • ■ , R13 appear below, with Column —2 highlighted. 




1 Formally we are making the substitution x = X/n=o u "*™ + ^ T 1 y but as we will be making 
additional substitutions we prefer to be slightly sloppy than overly complicated. 
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Figure 2: Spacetime diagram of a cellular automaton with memory 20 containing 
the Rudin-Shapiro sequence. 



Note that the polynomial we computed from Christol's theorem is not the mini- 
mal polynomial of F(t), since x = F(t) is also a zero of t-\-(l+t 2 )x-\-(l+t+t 2 +t 3 )x 2 . 
Using this polynomial instead produces a different polynomial 

P(t, x ) = (t + t 3 + t 4 ) + (1 + t 2 )x + {t + t 2 + t 3 + t A )x 2 

and hence a different cellular automaton. In fact d = 4 and r = 1, so the memory is 
lowered to 6. 



Example 4.2. In Example 4.1 the coefficient of x l in the polynomial obtained from 
Christol's theorem was nonzero, and this saved some work. Therefore let us work 
out example where none of the steps in the algorithm are trivial. Additionally, we 
go through the construction of the polynomial from Christol's theorem. The Rudin- 
Shapiro sequence is the 2-automatic sequence (u n ) n >o = 0, 0, 0, 1, 0, 0, 1, 0, . . . where 
u n = if the number of (possibly overlapping) occurrences of 11 in the binary 
representation of n is even and u n = 1 otherwise. 



First we apply Theorem 2.9 as follows to construct a polynomial equation satis- 
fied by F(t) = ^2 n>0 u n t n = t 3 +t 6 +t u +t 12 + - ■ ■ . There are |5| = 4 sequences in the 

2-kernel of (u n ): (u n ) = (u 2n ) = (U4n+l), («2n+l) = {ugn+7), («4n+3) = («16n+ll), 

and (usn+3) = (uiQn+3)- Each of the generating functions of these four sequences 
can be broken up into its even- and odd-index terms and rewritten in terms of the 
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original four generating functions: 

F(t) = F^t) :=J2 u nt n = F l (t) 2 + tF 2 (t) 2 

n>0 

F 2 (t) :=^n 2n+1 f = F 1 (t) 2 + tF 3 (f) 2 

n>0 

F 3 (t) := 5> 4n+ 3t n = F A {t) 2 + tF 2 {t) 2 

n>0 

Fi(t) :=J2 u Sn+3t n = F 4 (t) 2 + tF 3 (t) 2 . 



n>0 



Using these four equations iteratively, we may write each F{t 21 ) for < i < \S\ as 
a linear combination of Fj(t 2 51+1 ). The result of doing so is 



i + 1 2 + t l + t* + 1 10 + 1 14 t 16 + 1 18 + 1 20 + 1 28 t 22 + 1 24 + 1 26 + 1 30 t 6 + 1 12 

,4^,8 ( 16 +t 20 +t 28 



1 + ** + «' 
l + t f 
1 





o / 



^l(t 32 )\ 

F 2 (t 32 ) 
F 3 (t 32 ) 



( F W \ 

F(i 2 ) 
F(t 4 ) 
F(t 8 ) 



where 



We have a system of equations in 9 variables (the series Fj(t 32 ) for 1 < j < 4 and 
F(t 2t ) for < i < 4), and performing Gaussian elimination on the corresponding 
5x9 matrix gives, in the bottom row, the coefficients of a polynomial equation 
satisfied by x = F(t), namely 



tV + (1 + t b )x 4 + (1 + t 4 + t s + t 12 )x 8 



0. 



Next we apply Proposition 2.11 Since x 1 does not appear in the polynomial 
equation we have found, extract terms whose power of t is even (which in this case 
is all terms) and raise both sides to the power 1/2. The resulting equation 



t 6 x + (1 + t 6 )x z + (1 + t z + t 4 + t D )x 4 = 

has a nonzero coefficient of x 1 , as desired. To obtain a coefficient of x 1 that is not 
divisible by t, let r = 4, replace x \- > + Ot + Ot 2 + t 3 x, and divide by i 6 . Then 
x = G*(t) := X^ n > ^n+3* n satisfies 

x + (1 + t3 )x 2 + (+ 6 + t 8 + t 10 + t 12 )x 4 = 



Finally, we apply Proposition 3.2 Replace x with 1 + 0t + tx so that x = G(t) 
J2n>i u n+4 tn satisfies P(t,G(t)) = 0, where 



P(t, x) = (t 2 + t b + t 7 + t 9 + t 11 ) + x + (t + t 4 )x 2 + (t 9 + t 11 + i 13 + t lb )x 



f 9 , +11 i +13 i +15\^,4 
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Figure 3: Spacetime diagram showing the beginning of the infinite history of an 
invertible cellular automaton containing the Rudin-Shapiro sequence. 



By Proposition 



2.12 



?x n +4 is the coefficient of x in Rn{x) for all n > 1, where 



n>0 V / \ / 



+ X° I t 4 + xH 5 + 



Moreover, R n (x) satisfies the recurrence 



R n (x) = xR n -i(x) + -i? n _ 2 (x) + xR n - 4 (x) + -R n - 5 ( x ) + -R n _ 7 (x) 
x xx 



1 



1 



X 



+ - + X 6 ) R n -g{x) + - + X 6 ) Rn-ll(x) + X 6 Rn- 13 (x) + X^Rn-l^x) 



for all n > 16. Therefore the cellular automaton rule has memory 15, which 
we increase to 20 to reinstate the initial rows. The first 256 rows of the resulting 
spacetime diagram appear in Figure [2j 

Note that since u r happened to be 0, the highest power of t in P(t, x) appears 
only in the coefficient of x 4 , and therefore the cellular automaton is invertible in 
accordance with Corollary 3.6 Figure [3] shows the spacetime diagram for rows 
-R_2i5 through R 40 . 



Example 4.3. The Baum-Sweet sequence is the 2-automatic sequence (u n )n>o = 
1, 1,0, 1, 1, 0, 0, 1, . . . where u n = if the binary representation of n contains a block 
of 0s of odd length and u n = 1 if not. (Note we consider the binary representation 
of to be the empty word.) Christol's theorem gives t 2 x + (1 + t 3 + t^x 2 + t 6 x 4 + 
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Figure 4: Spacetime diagram of a cellular automaton with memory 27 containing 
the Baum-Sweet sequence. 



(1 + t 4 )x 8 = satisfied by x = J2 n >o Un ^ n - ^he output of Proposition 
polynomial 



3.2 



is the 



P(t, x) = {t + t i + t i + t I + t Vi + t w + 1 23 ) + x + (t + 1 4 + t 5 )x 2 + t ia x 4 + (t w + t 2a )x«. 

Therefore we have a cellular automaton with memory d + r + l = 23 + 3 + l = 27. 
The first 192 rows appear in Figure |4j 



5 Substitution dynamical systems as factors of cellular 
automata 

In this section we apply Theorem |1.1| to conclude that certain dynamical systems 
arise as factors of cellular automata. First we define some terms. 

Definition 5.1. Let (X,S) and (Y,T) be two dynamical systems. 

1. If X is a closed subset of Y and T(X) C X, then we say that (X,T) is a 
subsystem of (Y, T). 

2. If there exists a homeomorphism ^ : Y — > X with $ o T = 5 o $, we say the 
dynamical systems (X, S) and (Y, T) are topologically conjugate. 

3. If (X, S) is conjugate to a subsystem of (1", T), then we say that (Y, T) embeds 
(X,S). 

4. If there exists a continuous surjective mapping ^ : Y — ?■ X such that So$ = 
f o T, we say (X, 5) is a (topological) factor of (Y, T). 
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Definition 5.2. If u € .4 N , define X u := {o~ n (u) : n G N}. The dynamical system 
(X u ,cr) is called the (one-sided) subshift associated with u. 



Theorem 1.2 Let u be p-automatic. Then (X u ,a) is a factor of a subsystem of 
some linear cellular automaton ((F^) z ,<l>). 



Proof. By Corollary 3.4, u is the image, under a letter-to-letter projection, of a 
sequence v which appears as a column in the space-time diagram, with initial con- 
dition Rq, of a linear cellular automaton <E>. We shall show that (X v ,a) is a factor 
of a subsystem of ((F^+ r+1 ) z , $); the fact that u is a letter-to-letter projection of v 
implies that (X u , a) is a factor of (X v , a). 

Define the map f : {<5> n (R )} n >o -> {^ n (v)}„> as ^($ n (i? )) := o" n (v). Since v 
is a column of the space-time diagram of $ with initial condition Rq, and is defined 
by a local rule, then it is straightforward to see that \T/ is uniformly continuous. We 
claim that the map : {& n (Ro)} — > {c n (v)} extends to a continuous surjection 
* : {<I> n (i?o)} -> -^v satisfying * o $ = cr o xj/. The proof is standard but we 
include it. Fix R G {<J? n (.Ro)} and suppose that $ nk (Ro) ->■ -R- We will show that 
the set {cr nfc (v)} has a unique limit point y, which is independent of the sequence 
(nfc). Given e, the uniform continuity of on the <3?-orbit of Rq implies that we 
can find a 5 such that d(a n (v), cr m (v)) < e/3 whenever d($ n (Ro), <S> m (R )) < 5 
(where d is the metric generated by the topology on the relevant Cantor space). 
Thus if a nk i (v) — > y and a fe ; (v) — > y', then there is an L such that if / > L then 
d(a nk i(v),o- h 'i(v)) < e/3. If L is also large enough so that a nk i(v), cr M(v) are 
e/3 -close to y, y' respectively, then d(y,y') < e. Hence y = y'. 

Now suppose that & mk (Ro) ->■ i?. The proximity of <I> nfc (i?o) and $ mfe (i? ), for 
large fe, implies the proximity of o~ nk (v) and a mk (v) for large fc, which implies that 
the limit point of each of the sets {a nk (v)} and {<7 mfc (v)} is the same; let this limit 
point be y. We can now define *&(R) = y. To see that ^ is continuous, note that if 
R and Rl are close, and ^(Rq) -> R, $<(Ro) -> then for large fc $ nfc (i? ) and 
$ n fc(i?o) are close, which implies that y and y' are close. To see that is surjective: 
if ci nfc (v) ->■ y, then let R be a limit point of ^> nk (R ): then = y. Finally if 

<S> nk (R ) -> i? and a nfe (u) -> y = *(JJ) then 

$ o $(i?) = $ o $(lim$ nfc (i? )) = lim<r n * +1 (u) = cr lim a nk (u) = <r(tf (J?)) . □ 

k k 

We now define a class of well-studied subshifts, and state Cobham's theorem, 
which tells us that these subshifts arise from p-automatic sequences. 

Definition 5.3. Let S be a finite alphabet. A substitution (or morphism) is a map 
t : S — y S + . The map r extends to a map r : 5 + U5 N — > 5 + U5 N by concatenation: 
if a = oi • • • Ofc • • • , then r(a) := r(ai) • • • T(a^) 
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Definition 5.4. Suppose the substitution r is such that there is some k with |r(o)| = 
k for each a G S. In this case we say that r is a length-k substitution (or k-uniform 
morphism). 

Definition 5.5. A fixed point of r is a sequence v = (v n ) n >o G 5 N such that 
r(v) = v. 

Cobham's theorem |Cob72| gives us the relationship between /c-automatic se- 
quences and fixed points of length-/c substitutions: 

Theorem 5.6. A sequence is k-automatic if and only if it is the image, under a 
letter-to-letter projection, of a fixed point of a length-k substitution. 

Dynamicists have extensively studied substitution subshifts — references detail- 
ing some of this work include [Fog02| and |QuelO| . Combining Cobham's theorem 



with Theorem 1.2, we obtain the following. 



Corollary 5.7. Let v be a substitution sequence generated by a length-p substitution. 
Then (X v , a) is a factor of a subsystem of some linear cellular automaton ((Fg) z , $). 



It would be interesting to know whether the factor mapping in Theorem 1.2 
is, in some or all cases, an embedding. This is in principle possible: in |PY10| . 
substitution systems are embedded in subsystems of cellular automata; however the 
cellular automata are tailored for the specific substitution and have no nice algebraic 
or combinatorial structure. We end with an extra condition on the polynomial given 



by Proposition 3.2 which would give an embedding of the substitution subshift into 
a cellular automaton, and leave as an open question whether such a polynomial 
always exists. 



Corollary 5.8. Suppose, using the notation of Proposition \3Jq and Theorem \37i 
that both A m (t) = at d and B(t) = (3t d are monomials of degree d. Then (X u ,a) is 
the letter-to-letter projection of a subshift that can be embedded in a linear cellular 
automaton. 

Proof. Recall that R n {x) = ~Yli=i TJ^)^n-i{x). If A m (t) is a monomial, then 
only one of the polynomials Ci(x), say iji, has the x p ™ term. This means that if 
the left radius of <1> is p m — 1, and <3? = X^=i ^ where <l>j is the cellular automaton 
defined by c$x) ' then other than all <3?j's have radius strictly less than p m — 1. 
Rotating our original spacetime diagram 5$ counter-clockwise by 90 degrees, we see 
a new spacetime diagram for another cellular automaton with memory I + r + 1. 
Similarly, only one of the cellular automata &d will have right radius / = 1, so that 
rotating 5$ by 90 degrees clockwise, we see another spacetime diagram for another 
cellular automaton with memory I + r + 1. This means that if in 5$, the central 
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l+r+1 columns C_/, . . . , C r have the same entries in a large enough block of length L 
starting at locations m\,m2 respectively, then the entries in two rows R mi , R m2 will 



agree in a large central block. Thus if in the proof of Theorem 1.2 we consider Xv 



where V € g l + r + l j s the infinite word defined by the columns C_/, . . . ,C r , then the 



map defined in Theorem 1.2 is a topological conjugacy between ({& n (Ro)}, <&) 
and (Xy,a). Projecting Xv to the appropriate column containing u, the result 
follows. □ 
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